Look-ahead techniques for fast beam search
نویسندگان
چکیده
In this paper, we present two efficient look-ahead pruning techniques in beam search for large vocabulary continuous speech recognition. Both techniques, the language model look-ahead and the phoneme look-ahead, are incorporated into the word conditioned search algorithm using a bigram language model and a lexical prefix tree [5]. The paper present the following novel contributions: We describe a method for language model (LM) look-ahead pruning which is similar to [1, 9]. We show special techniques to reduce the memory and computational requirements. These techniques are based on a compressed LM look-ahead tree. To compute the LM look-ahead tree probabilites in an efficient way, we present a backward dynamic programming scheme. We present a phoneme look-ahead pruning technique to increase the efficiency of the acoustic pruning. In particular, we refine the acoustic pruning strategy by a 1and 2-phoneme look-ahead, respectively. We report results for both look-ahead pruning methods on the the 20,000-word North American Business (NAB’94) task. As a result, the combination of bigram look-ahead and 1phoneme look-ahead reduces the search space by a factor of 10 without loss in recognition accuracy in comparison with the baseline search using a unigram language model look-ahead as described in [2]. The computational costs can be reduced by a factor of 5 on a SGI workstation (Indy R4400).
منابع مشابه
Look-ahead Techniques for Improved Beam Search
This paper presents two look-ahead techniques for large vocabulary continuous speech recognition. These two techniques, which are referred to as language model look-ahead and phoneme look-ahead, are incorporated into the pruning process of the time-synchronous one-pass beam search algorithm. The search algorithm is based on a tree-organized pronunciation lexicon in connection with a bigram lang...
متن کاملReducing time-synchronous beam search effort using stage based look-ahead and language model rank based pruning
In this paper, we present an efficient look-ahead technique based on both the Language Model (LM) Look-Ahead and the Acoustic Model (AM) Look-Ahead, for the time-synchronous beam search in the large vocabulary speech recognition. In this so-call stage based look-ahead (SLA) technique, two predicting processes with different hypothesis evaluating criteria are organized by stages according to the...
متن کاملAn augmented beam-search based algorithm for the strip packing problem
In this paper, the use of beam search and look-ahead strategies for solving the strip packing problem (SPP) is investigated. Given a strip of fixed width W, unlimited length L, and a set of n circular pieces of known radii, the objective is to determine the minimum length of the initial strip that packs all the pieces. An augmented algorithm which combines beam search and a look-ahead strategie...
متن کاملImproved lexical tree search for large vocabulary speech recognition
This paper describes some extensions to the language model (LM) look-ahead pruning approach which is integrated into the time-synchronous beam search algorithm. The search algorithm is based on a lexical prefix tree in combination with a wordconditioned dynamic search space organization for handling trigram language models in a one-pass strategy. In particular, we study several LM look-ahead pr...
متن کاملIDA*_MCSP: a fast exact MCSP algorithm
QoS routing has been shown to be NP-hard. A recent study of its hardness suggests that the “worst-case” may not occur in practice and thus there may exist a fast exact algorithm. In this paper, we deploy the idea of iterative deepening search and look ahead to design an exact algorithm for finding the shortest path subject to multiple constraints (the MCSP problem). The accuracy of look-ahead i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computer Speech & Language
دوره 14 شماره
صفحات -
تاریخ انتشار 1997